Corpus: lav_newscrawl_2016_1M

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 55501 p-
2 48095 n-
3 44763 s-
4 37230 a-
5 34786 i-
Top Character Bigrams
word rank frequency n-gram
1 29427 ne-
2 18531 pa-
3 16115 sa-
4 15324 iz-
5 14858 no-
Top Character Trigrams
word rank frequency n-gram
1 9088 pie-
2 6203 pār-
3 4570 nep-
4 4109 aiz-
5 3489 nes-
Top Character 4-Grams
word rank frequency n-gram
1 1837 neiz-
2 1687 nesa-
3 1658 nepa-
4 1609 neno-
5 1550 nepi-
Top Character 5-Grams
word rank frequency n-gram
1 1370 nepie-
2 887 priek-
3 838 pamat-
4 751 inter-
5 677 cilvē-
3989 msec needed at 2018-03-15 06:25